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ABSTRACT 

*0 ! The cross-correlation of the Ly-a forest and redshifted 21-cm emission has 

r**"" ■ recently been proposed as an observational tool for mapping out the large- 



scale structures in the post-reionization era z < 6. This has a significant 
advantage as the problems of continuum subtraction and foreground removal 
are expected to be considerably less severe in comparison to the respective 
auto-correlation signals. Further, the effect of discrete quasar sampling is con- 
' siderably less severe for the cross-correlation in comparison to the Ly-a forest 

h : 

auto-correlation signal. In this paper we explore the possibility of using the 
cross-correlation signal to detect the baryon acoustic oscillation (BAO). To 
this end, we have developed a theoretical formalism to calculate the expected 
cross-correlation signal and its variance. We have used this to predict the ex- 
pected signal, and estimate the range of observational parameters where a 
detection is possible. 

For the Ly-a forest, we have considered BOSS and BIGBOSS which are 
expected have a quasar density of 16 deg -2 and 64 deg -2 respectively. A radio 
interferometric array that covers the redshift range z = 2 to 3 using antennas 
of size 2 m x 2 m, which have a 20° x 20° field of view, is well suited for the 
redshifted 21-cm observations. It is required to observe 25 independent fields 
of view, which corresponds to the entire angular extent of BOSS. We find 
that it is necessary to achieve a noise level of 1.1 x 10 -5 mK 2 and 6.25 x 
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10~ 6 mK 2 per field of view in the redshifted 21-cm observations to detect the 
angular and radial BAO respectively with BOSS. The corresponding figures 
are 3.3 x f(T 5 mK 2 and 1.7 x 1CT 5 mK 2 for B1GBOSS. We also discuss possible 
observational strategies for detecting the BAO signal. Four to five independent 
radio interferometric arrays, each containing 400 antennas uniformly sampling 
all the baselines within 50 m will be able to carry out these observations in 
the span of a few years. 

Key words: cosmology: theory - large-scale structure of Universe - cosmol- 
ogy: diffuse radiation 

1 INTRODUCTION 

Neutral hydrogen (HI) in the post-reionization epoch (z < 6) is known to be an impor- 
tant cosmo logical probe seen in both emission and absorption. Here, the redshifted 21-cm 
emission (Furlanetto et al., 2006; Morales & Wyithe, 2009) and the transmitted QSO flux 
through the Ly-a forest (Weinberg et al., 1998; Mandelbaum et al., 2003) are both of ut- 
most observational interest. In a recent paper Guha Sarkar et al. (2011) have proposed the 
cross-correlation of the 21-cm signal with the Lyman-a forest as a new probe of the post- 
reionization era. While it is true that the emission and the absorption signals both originate 
from neutral hydrogen (HI) at the same redshift (or epoch), these two signals, however, 
do not originate from the same set of astrophysical sources. The 21-cm emission originates 
from the HI housed in the Damped Lyman-a systems (DLAs) which are known to contain 
the bulk of the HI at low redshifts (Prochaska et al., 2005). The collective emission from 
the individual clouds appears as a diffuse background in low frequency radio observations 
(Bharadwaj et al., 2001). On the contrary, the Ly-a forest consists of a large number of Ly-a 
absorption lines seen in the spectra of distant background quasars. These absorption features 
arises due to small density fluctuations in the predominantly ionized diffuse IGM. On large 
scales, however, the fluctuations in the 21-cm signal and the the Ly-a forest transmitted flux 
are both believed to be excellent tracers of the underlying dark matter distribution. Guha 
Sarkar et al. (2011) have proposed that the cross-correlation between these two signals can 
be used to probe the power spectrum during the post-reionization era. 
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The HI power spectrum can be determined separately from observations of the Ly-a 
forest (Croft et al., 1998) and the redshifted 21-cm emission (Bharadwaj & Sethi, 2001). 
The detection of the individual signals, however, face severe observational challenges. The 
Ly-a auto-correlation power spectrum is expected to be dominated by the Poisson noise 
arising from the discrete sampling of QSO lines of sight. The cross-correlation signal has 
the advantage that it is not affected by the Poisson noise which affects only its variance. 
Uncertainties in fitting the QSO continuum (Croft et al., 2002; Kim et al., 2004, 2007) pose 
another challenge for using Ly-a observations to determine the power spectrum. Astronomi- 
cal sources like the galactic synchrotron emission, extra-galactic point sources, etc. appear as 
foregrounds for the redshifted 21-cm signal. These foregrounds, which are several orders of 
magnitude larger than the signal, pose a great challenge for detecting the post-reionization 
21-cm signal (Ghosh et al., 2010, 2011). The foregrounds and systematics in the Ly-a forest 
and the redshifted 21-cm emission are, however expected to be uncorrelated and we therefore 
expect the problem to be much less severe for the cross-correlation signal. A detection of 
a cross-correlation signal shall, hence, conclusively ascertain its cosmological origin. Apart 
from being an independent probe of the large scale matter distribution, the cross-correlation 
signal can potentially unveil the same astrophysical and cosmological information as the 
individual auto-correlations. 

Cosmological density perturbations drive acoustic waves in the primordial baryon-photon 
plasma which are frozen once recombination takes place at z ~ 1000, leaving a distinct oscil- 
latory signature on the CMBR anisotropy power spectrum (Peebles & Yu, 1970). The sound 
horizon at recombination sets a standard ruler that maybe used to calibrate cosmological 
distances. Baryons contribute to 15% of the total matter density, and the baryon acoustic 
oscillations are imprinted in the late time clustering of non-relativistic matter. The signal, 
here, is however suppressed by a factor ~ Qb/Q m ~ 0.1, unlike the CMBR temperature 
anisotropies where it is an order unity effect (Komatsu et al., 2009). The baryon acoustic 
oscillation (BAO) is a powerful probe of cosmological parameters (Seo & Eisenstein, 2003; 
White, 2005). This is particularly useful since the effect occurs on large scales (~ 150 Mpc), 
where the fluctuations are still in the linear regime. It is possible to measure the angular 
diameter distance and the Hubble parameter as functions of redshift using the the transverse 
and the longitudinal oscillations respectively. These provide means for estimating cosmolog- 
ical parameters and placing stringent constraints on dark energy models. Nonlinear effect 
of gravitational clustering tend to wipe out the BAO signal, and it is preferable to avoid 



4 Guha Sarkar, Bharadwaj 

very low redshifts where this is a potential problem. However, very high redshifts too are 
not very useful for constraining dark energy models. Several authors have reported a 2 to 
3a detection of the BAO in low redshift galaxy surveys (Eisenstein et al., 2005; Percival 
et al., 2007, 2010). The possibility of detecting the BAO signal in the Ly-a forest has been 
extensively studied by McDonald & Eisenstein (2007). Several groups have also considered 
the possibility of detecting the BAO signal using redshift 21-cm emission (Chang et al., 
2008; Mao & Wu, 2008; Masui et al., 2010). 

Several QSO surveys are now being considered with the intent of measuring the BAO 
using the Ly-a forest (eg. BOSS (McDonald et al., 2005) and BIGBOSS (Schlegel et al., 
2009)). The possibility of a wide field redshifted 21-cm survey to detect the BAO is also 
under serious consideration. In this paper, we consider the possibility of studying the BAO 
using the cross-correlation signal. In Section 2. of this paper we quantify the cross-correlation 
between the Ly-a forest and the 21-cm emission, and present theoretical predictions of 
the expected signal. We have used the multi- frequency angular power spectrum Ce(Az) 
(MAPS, Datta et al. 2007) in preference to the more commonly used three dimensional 
power spectrum P{k) to quantify the cross-correlation signal. This has several advantages 
which we briefly discuss here. MAPS refers to the angular multipole i (or equivalently 
angle) and redshift interval Az which are the directly relevant observational quantities for 
both Ly-a surveys and redshifted 21-cm observations. This is particularly important if we 
wish to determine the angular scales and redshift intervals which need to be covered in 
order to detect a particular feature in the signal. The foregrounds in the redshifted 21-cm 
observations and the continuum in the Ly-a forest are both expected to have a smooth, 
slow variation along the frequency axis, and this plays a crucial role in removing these from 
the respective data. It is therefore advantageous to use MAPS which retains the distinction 
between the frequency (redshift) and angular information, unlike P(k) which mixes these up. 
Ali et al. (2008) have analyzed GMRT observations using MAPS to jointly characterize the 
angular and frequency dependence of the foregrounds at 150 MHz. Ghosh et al. (2010) and 
Ghosh et al. (2011) have applied MAPS to Analyze 610 MHZ GMRT observations, and have 
used this to characterize the foregrounds for the post-reionization 21-cm signal. In fact, they 
show that it is possible to completely remove the foregrounds from the measured CV(Az/) by 
subtracting out polynomials in Az/. Finally, the signal could have a significant contribution 
from the light cone effect, particularly if the observations span a large redshift interval. It is, 
in principle, possible to account for this in the MAPS, though we have not done this here. It 
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is, however, not possible to account for this effect in the three dimensional power spectrum 
which mixes up the information from different epochs through a Fourier transform along the 
radial direction. 

In Section 3 of this paper we quantify the imprint of the BAO feature on the cross- 
correlation signal. Here MAPS has the advantage that it allows us to separately study the 
radial and the transverse oscillations through the Av and the t dependence respectively. In 
Section 4 we introduce an estimator for the MAPS of the cross-correlation signal and derive 
its statistical properties. In particular, we present a detailed analysis of the noise for the cross- 
correlation estimator. In Section 5 we present several observational considerations which are 
relevant for the cross-correlation signal, some pertaining to the Ly-a forest and others to 
the redshifted 21-cm signal or both. Finally, in Section 6 we discuss the detectability of the 
cross-correlation signal and the BAO features. We have used the cosmological parameters 
{fl m h 2 , Q b h 2 , fi A , h, n s , a 8 ) = (0.136, 0.023, 0.726, 0.71, 0.97, 0.83) from Komatsu et al. (2009) 
throughout this paper. 



2 THE CROSS-CORRELATION SIGNAL 

We quantify the fluctuations in the transmitted flux J-{n, z) along a line of sight n to a quasar 
through the Ly-a forest using Sjr(n, z) = Tifi^z^jT — 1. For the purpose of this paper we 
are interested in large scales where it is reasonable to adopt the fluctuating Gunn-Peterson 
approximation (Gunn & Peterson, 1965; Bi & Davidsen, 1997; Croft et al., 1998, 1999) 
which relates the transmitted flux to the matter density contrast 5 as J 7 = exp[— A(l + 5) K ] 
where A and k are two redshift dependent quantities. The function A is of order unity (Kim 
et al., 2007) and depends on the mean flux level, IGM temperature, photo-ionization rate 
and cosmological parameters (Croft et al., 1999), while k depends on the IGM temperature 
density relation (McDonald et al., 2001; Choudhury et al., 2001). For our analytic treatment 
of the Ly-a signal, we assume that the measured fluctuations 5jr have been smoothed over a 
sufficiently large length scale such that it is adequate to retain only the linear term 5jr oc 5 
(Bi & Davidsen, 1997; Croft et al, 1998, 1999; Viel et al., 2002; Saitta et al, 2008; Slosar 
et al., 2009). The terms of higher order in 5 are expected to be important at small length 
scales which have not been considered here. 

We use 8t{ti,z) to quantify the fluctuations in T(n,z) the brightness temperature of 
redshifted 21-cm radiation. In the redshift range of our interest [z < 3.5), it is reasonable 
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to assume that 8t{ti, z) traces 5 with a possible bias (Bharadwaj & Sethi, 2001; Bharadwaj 
et al., 2001). The bias is expected to be scale dependent below the Jeans length-scale (Fang 
et al., 1993). Fluctuations in the ionizing background also give rise to a scale dependent bias 
(Wyithe & Loeb, 2007, 2009). Moreover, this bias is found to grow monotonically with z 
(Marin et al., 2009). However, numerical simulations (Bagla et al., 2009; Guha Sarkar et al., 
2011), indicate that it is adequate to use a constant, scale independent bias at the large 
scales of our interest. 

With the above mentioned assumptions and incorporating redshift space distortions we 
may express both Sjr and St as 

S a (n, z)=C a J -j^L e* k --[l + A*/i 2 ]A(k) . (1) 

where a = J 7 and T refer to the Ly-a forest and 21-cm signal respectively. Here r is the 
comoving distance corresponding to z, A(k) is the matter density contrast in Fourier space 
and n — k • n. We adopt the values Cjr = —0.13 and (3jr = 1.58 from Ly-a forest simulations 
of (McDonald, 2003), and C T = T x m b and f3 T = f/b for the 21-cm signal (Guha Sarkar 
et al., 2011). These values are for z = 2.5 which is the fiducial redshift in our analysis. We 
note that there are large uncertainties in the values of all the four parameters Ct, Cjr, (3? 
and (3jr arising from our poor knowledge of the state of the diffuse IGM and the systems 
that harbour bulk of the neutral hydrogen at z ~ 2.5. 

A QSO survey (eg. SDSS 1 ), typically, covers a large fraction of the entire sky. In contrast, 
a radio interferometric array (eg. GMRT 2 usually has a much smaller field of view ( ~ 1°). 
Only the overlapping region common to both these observations provides an estimate of the 
cross-correlation signal. We therefore use the limited field of view L x L ( L in radians) 
of the radio telescope to estimate the cross-correlation signal. Given this constraint, it is a 
reasonable observational strategy to use several pointings of the radio telescope to cover the 
entire region of the QSO survey. Each pointing of the radio telescope provides an independent 
estimate of the cross-correlation signal, which can be combined to reduce the cosmic variance. 

We have assumed that the field of view is sufficiently small (L « 1) so that curvature 
of the sky may be ignored. In the flat sky approximation the unit vector n along any line of 
sight can be expressed as h = m + 9, where rh is the line of sight to the centre of the field of 

— * 

view and 9 is a two-dimensional (2D) vector on the plane of the sky. In this approximation 
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it is convenient to decompose 5^(0, z) and 5t(9, z) into Fourier modes instead of spherical 
harmonics. We then have the Fourier components 

A a (U,*) = / #0e-* m ™ 5 a (0,z) (2) 

J-L/2 

where U is a two dimensional vector conjugate to 9. Vector U which represents an inverse 

— * 

angular scale, and also 9 are both perpendicular to the line of sight to the centre of the 
field of view m. It is useful to visualize A Q (U, z) as Fourier components of the signals on a 
plane perpendicular to the line of sight rh located at a comoving distance r corresponding 
to the redshift z. The redshift z, here conveys two different pieces of information, namely 
the distance along the line of sight and also the epoch where the HI signals originated. 
We introduce the multi-frequency angular power spectrum (MAPS, Datta et al. 2007) 

( A Q (U, z)A;(U', z + Az)) = L 2 5 vu , P a7 (U, Az) (3) 

where the indices a and 7 have the possible values a = T, J 7 and 7 = T, J 7 . The multi- 
frequency angular power spectrum Ptf(\J, Az) = Pjr T (XJ, Az) refers to the cross-correlation 
signal, while P^(U, Az) and Ptt(U, Az) refer to the respective auto-correlation signals. 
It should be noted that the multi-frequency angular power spectra refer to the signals at 
two different redshifts z and z + Az. It is useful to visualize these power spectra as the 
correlation of angular modes which are defined on two different planes at comoving distances 
r and r + Ar corresponding to z and z + Az respectively. Our entire analysis is performed 
in a narrow range of redshifts Az <C z. The power spectrum also depends on z, however 
the variation with Az is much faster as compared to the z dependence and the latter is 
not explicitly shown. The value of z for which the results have been shown is mentioned 
separately wherever required. 

We emphasize here that the power spectra P Q7 (U, Az) are directly related to observable 
quantities. In particular, the visibilities measured in radio interferometric observations of 
the redshifted 21-cm signal (Bharadwaj & Sethi, 2001; Bharadwaj & Ali, 2005) are directly 
related to A T (U, z) defined here. The multi-frequency angular power spectra P ai (XJ,Az) 
contain the entire three dimensional information through the U and Az dependence. The 
fluctuations in the signal in the transverse directions are analyzed using Fourier modes U 
which are equivalent to the angular multipoles in the flat sky approximation, whereas Az 
corresponds to a comoving separation along the radial direction. 

The statistical homogeneity and isotropy of the cosmological density fluctuations ensures 
that the power spectra defined above are real, though it is not apparent from equation (3). 
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Table 1. The conversions between length scales expressed in various units. 

It also follows that the power spectra are isotropic in U and depend only on U = |U|. The 
assumption that both 5jr and 5t are related to the matter fluctuations 5 allows us to express 
all the angular power spectra considered here in terms of the three dimensional matter power 
spectrum P(k). Following Datta et al. (2007), we have 

CO 

P aj (U, Az) = ^J dk\\ cos(A;||Ar) F a » P(k) . (4) 
o 

where Ar = cAz/H(z) is the radial comoving separation corresponding to the redshift 
separation Az, k = \Jk^ + (^-) 2 , /i = k\\/k. and 

F a M = C a C 7 [1 + f3 af i 2 } [1 + /V 2 ] (5) 

We may identify the angular mode U with the angular multipole £ as 2nU = £ whereby 
the power spectrum P a (U,Az) can be identified with the multi-frequency angular power 
spectrum (MAPS) C^(Az) = P a (U,Az) under the flat-sky approximation (Datta et al., 
2007). The approximation is known to be reasonably good on scales £ > 10 considered 
in our subsequent analysis. We shall use £ and U interchangeably to represent the inverse 
angular scale and similarly one may think of P a (U, Az) and Cf(Az) to both equivalently 
represent the angular power spectrum. 

Quasar surveys indicate that the redshift distribution of the quasars peaks in the range 
z = 2 to 3 (Schneider et al., 2005). For any given quasar, it is possible to estimate 5jr in only 
a small redshift range which is governed by the redshift of the quasar. The quasar's proximity 
effect excludes the region very close to the quasar. Large redshift separations are excluded to 
avoid contamination from other spectral lines. Given these considerations, we have chosen 
z = 2.5 as the fiducial redshift for our analysis. For comparison, we have also shown several 
of the results at the neighbouring redshifts z = 1.5 and 3.5. In the subsequent discussion we 
shall interchangeably use the three dimensional wave number k, the two dimensional Fourier 
mode U and the angular mode £ to refer to inverse angular scales on the sky. Similarly, it may 
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Figure 1. The LCDM linear power spectrum normalised to Z = 0. The correspondence between t and k is shown for three 
different redshifts. For a fixed £, all the Fourier modes to the right of the corresponding k value contribute to P c (£,Az). 
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Figure 2. The transverse angular power spectra Pjr T (l) and Pt(&) °f th e cross-correlation and 2f-cm auto-correlation signals 
respectively shown at z = 1 .5 , 2.5 and 3.5. 



sometimes be convenient to express the redshift separation Az in terms of a radial separation 
Ar, or a frequency separation Au = 1420Az/(l + 2) 2 MHz (for 21-cm observations), or a 
velocity separation Av\\ = cAz/(l + z) (for Ly-a forest). We have indicated the conversion 
between these various possibilities in Table 1. 



2.1 Transverse Angular Power Spectrum 

We first consider Pjr T (£) = Pjr T (U, Az) for Az = 0. The Ly-a forest and the 21-cm signals, 
here, both lie on the same plane transverse to the line of sight m. We refer to Pjtt(^), 
which only contains 2D information, as the transverse angular power spectrum of the cross- 
correlation signal. This has contribution from all the three dimensional Fourier modes k 
whose projection on the transverse plane matches E/r = 2irU jr. Thus, all 3D Fourier modes 
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k > tjr contribute to Pjr T (£). Figure 1 shows the 3D matter power spectrum P(k) for the 
LCDM model with WMAP 5 cosmological parameters Komatsu et al. (2009). The corre- 
spondence between k and £ is shown for redshifts z = 1.5, 2.5 and 3.5. For a fixed £, all the 
Fourier modes to the right of the corresponding k value contribute to Pjr T (£). 

The detectability of the cross-correlation signal is crucially dependent on the amplitude 
of Pjr T (£). This amplitude depends on Ct and CV, whose values are highly uncertain. Given 
these uncertainties it would be very difficult to interpret the amplitude if measured, quan- 
titatively in terms of the physical properties of the IGM. On the contrary it is easier to 
relate the shape of Pft(^) to the matter power spectrum and the comoving distance r. We 
elucidate this by considering a simple toy model for the matter power spectrum. In this 
model we have P(k) = Ak for k < k eq , and P(k) = Ak eq (k / k eq )~ 3 , for k > k eg . where k eg is 
the Fourier mode that enters the horizon at the epoch of matter-radiation equality. Simpli- 
fying eq. (4) by ignoring the effect of redshift space distortion, Ptt{£) is the 2D projection 
of the 3D matter power spectrum P(k) onto the plane transverse to the line of sight. For 
large values of £ (> k eq r) we have a power law PjtW ~ ACTCjr/n£ 2 . The angular power 
spectrum flattens out around £ = k eq r, which corresponds to the peak in the matter power 
spectrum. The position of this feature scales with redshift as £ oc r. At the low £ (< k eq r), we 
have Pjft ~ AC T Cjr/2irr 2 which is independent of £. We see that there is an enhancement 
of power at low £ (large angular scales ). This is a generic feature of projection from 3D to 
2D (Kaiser & Peacock, 1991). 

Figure 2 shows Pjr T (£) for the LCDM model. For comparison we have also shown the 
corresponding 21-cm auto-correlation angular power spectrum at the same probing redshifts. 
The Lj-a auto-correlation power spectrum is expected to be dominated by the Poisson noise 
arising due to the discrete QSO sampling, and this is not shown here. In addition to the 
fiducial redshift 2.5 we have also shown z = 1.5 and 3.5 with the intent of displaying 
how the shape of the angular power spectrum changes with z. As mentioned earlier, the 
amplitude and also its z dependence are relatively uncertain. To estimate the amplitude we 
have assumed that Cjr and f3jr do not change with z while we have calculated Ct and (3t 
using the z dependence from Guha Sarkar et al. (2011). We see that the shape of Ptt^P) 
predicted for the LCDM model is in reasonable agreement with our simple toy model. At 
large £ (> 1000) we have Pjr T (£) oc £~ 1 - 76 as compared to Prr{(-) oc £~ 2 for the toy model. 
This discrepancy arises because the actual matter power spectrum is shallower than k~ 3 for 
k > k eq . The LCDM predictions and the toy model are in close agreement at small £ where 
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Figure 3. The decorrelation function k^(Az) for different values of I at our fiducial redshift z = 2.. 
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Figure 4. The decorrelation function Kf(Az) for £ = 300 and z = 2.5, showing a closeup of the region where the cross-correlation 
signal is anti-correlated. 

Ptt{£) is nearly independent of i. The I value where the flattening occurs is also found to be 
consistent with the predictions of the toy model. The behaviour of the 21-cm auto-correlation 
angular power spectrum is very similar to that of the cross-correlation signal. 

2.2 Radial decorrelation function 

Here we fix I (or equivalently U) and study Pj?t{U, Az) as a function of Az. This considers 
the correlation between the signals Ajr(U, z) and Ar(U, z + Az) which refer to the same 
angular mode U but are respectively located on two different planes that are separated by a 
comoving distance Ar. The cosine term in eq. (4) arises from the fact that a single 3D mode 
projects onto the two different planes with a phase difference of fcn Ar . This oscillatory cosine 
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Figure 5. For two values of i (100 and 5000), this shows how the decorrelation function (Az) varies with z. 



term ensures that the radial correlation Pjft(^, A2:) decreases with increasing Az. Ignoring 
redshift space distortion, we have Pj?r{£, Az) oc J °° dk\\ cos(k\\ Ar) P(^ k 2 + (i/r) 2 ). This 



integral is very close to zero if P(^Jk 2 + (i/r) 2 ) is nearly constant over the range k\\ = 
to k\\ = 27r/Ar which corresponds to one oscillation of the cosine term. We thus expect 
Pjr T (£ 7 Az) to be maximum at Ar = 0, decrease with increasing Ar and remain close to zero 
for Ar larger than Ar ~ l-nrjt (Bharadwaj & Pandey, 2003). We quantify this behaviour 
using the decorrelation function (Datta et a!., 2007) 

= PMe) (6) 
which has value K,g(Az) = 1 at Az = 0, and varies in the range < |«^(Az)| < 1. Figure 3 
shows k>i(Az) for different t values at the fiducial redshift z = 2.5. We see that Kf(Az) falls 
with increasing Az and crosses zero beyond which the signal is anti-correlated (k^(Az) < 0). 
We define the "decorrelation length" Azo.i as the redshift separation where the decorrelation 
function falls to 10% of its peak value (ie. Ki(Az 01 ) = 0.1) . The signals Ajr(XJ,z) and 
At(U, z + Az) are weakly correlated beyond the corresponding radial separation Aro.i- The 
decorrelation length, we find, decreases with increasing i ie. the signal decorrelates faster 
at smaller angular scales. This relation is well fitted by the relation Azq.i oc £~ - 76 for the 
entire I and z range considered here. A similar behaviour has been reported earlier for the 
21-cm auto-correlation signal (Bharadwaj & Pandey, 2003). 

The signal crosses zero at Az > Azq.i, beyond which it is anti-correlated. For the par- 
ticular value I = 300, Figure 4 shows a magnified view of the region where the signal is 
anti-correlated. The behaviour is similar for other values of I where the value of Kg(Az) 
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Figure 6. The linear matter power spectrum P(fc), which has the BAO features, has been divided by P(k) nw the "no wriggles" 
power spectrum (Eisenstein & Hu, 1998). The corresponding t values have been shown for z = 1.5, 2.5 and 3.5. For a fixed I. 
all the Fourier modes to the right of the corresponding k value contribute to Pjr T (l, Az). 



varies is in the range —0.02 to —0.03 in this region. The value of Ke(Az) remains small, and 
oscillates around zero for very large Az (not shown in the figures). 

The decorrelation function is sensitive to the redshift being probed. Figure 5 shows 
Ke(Az) for three different redshifts. We find that though the generic features remain the 
same, the decorrelation length Az .i increases with increasing redshift for all values of £. 

We note that the Kg(Az) for the 21-cm auto-correlation has a behaviour very similar to 
Ke(Az) of the cross-correlation signal discussed here. In fact, the difference between these 
two is less than 1%. This small difference arises because of the difference in the values of fij? 
and Pt- The behaviour of the decorrelation function Ki(Az) is sensitive to the cosmological 
parameters and this can be used to observationally determine the cosmological parameters 
(Bharadwaj et al., 2009), however we do not discuss this here. We note here that the distinct 
behaviour of Ke(Az) for the 21-cm auto-correlation allows one to distinguish the cosmological 
signal from astrophysical foregrounds (Ali et al., 2008; Ghosh et al., 2010, 2011). 



3 THE BARYON ACOUSTIC OSCILLATIONS 

The characteristic scale of the BAO is set by the sound horizon s at the epoch of recombi- 
nation given by 

c.(a) A 
Ja a 2 H(a) 

where a r is the scale factor at the epoch of recombination and c s is the sound speed given 
by c s (a) = c/ a/3(1 + 3pfc/4p 7 ), where p& and p 7 denotes the photon and baryonic den- 
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I 

Figure 7. This shows the imprint of the BAO on the transverse angular power spectrum Pj?t(£) for the cross-correlation 
signal. To highlight the BAO we have divided Pjr T (£) by P^,(£) which corresponds to P(k) nw . This is shown for three 
rcdshifts z = 1.5,2.5 and 3.5. 

sities respectively. The comoving length-scale s defines a transverse angular scale 9 S = 
s[(l + z)Da{z)}~ 1 and a radial redshift interval Az s = sH(z)/c, where Da(z) and H(z) are 
the angular diameter distance and Hubble parameter respectively. The comoving length- 
scale s = 143 Mpc corresponds to 9 S = 1.38° and Az s = 0.07 at z — 2.5. Measurement of 9 S 
and Az s separately, allows the independent determination of Da{z) and H(z). The CMBR 
angular power spectra maps the density fluctuations on the plane of the sky at z ~ 1000. 
Hence, the imprint of BAO in the form of acoustic peaks in the CMBR anisotropy angular 
power spectrum is only in the transverse direction and constrains 6 S . Low redshift galaxy 
surveys contain both the transverse and the radial BAO peaks. However, the difficulties 
in probing large radial distances leads to small survey depths. Moreover, a large shot noise 
contribution degrades the SNR making it very difficult to independently measure Da(z) and 

1 /3 

H(z). Typically, the combination [(1 + z) 2 D\(z)cz / H \z)] is measured instead in galaxy 
redshift surveys (Eisenstein et al., 2005; Percival et al., 2007). The multi-frequency angu- 
lar power spectrum discussed here allows us, in principle, to measure the BAO imprint in 
the both transverse and the radial directions. Large bandwidth radio observations covering 
a large portion of the sky, along with high density quasar surveys with spectra measured 
at high SNR would allow a detection of the BAO feature in both radial and transverse 
directions. The different sensitivities of the measured quantities Da{z) and H(z) on the 
cosmo logical parameters would ensure breaking of degeneracies in the parameter space. For 
example D&{z) shall constrain curvature more efficiently than H(z). Further, independent 
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measurement of D A (z) and H(z), would allow an internal cross check thereby making pa- 
rameter estimation self consistent. 

The BAO manifests itself as a series of oscillations in the linear matter power spectrum 
(Eisenstein & Hu, 1998). We focus on the first BAO peak which has the largest amplitude. 
This is a ~ 10% feature in P(k) at k m 0.045 (Figure 6) which corresponds to £ 270 at 
z = 2.5. We also see that the first peak will shift to lower £ at smaller redshifts. 

Figure 7 shows the BAO feature in the transverse angular power spectrum of the cross- 
correlation signal Pjr T (£). The BAO, here, is seen projected onto a plane. The BAO appears 
as a series of oscillations in Pjr T (£), the positions of the peaks being consistent with £ ~ k/r. 
The amplitude of the first oscillation in Ptt{£) is around 1%, in contrast to the ~ 10% 
feature seen in P(k). This reduction in amplitude arises due to the projection to a plane 
whereby several 3D Fourier modes which do not have the BAO feature also contribute to 
the £ where the first BAO peak is seen. At z — 2.5 the first peak occurs at £ ~ 270 and it 
has a full width of A£ ~ 200. The position £ and width A£ of the peak both scale as r if 
the redshift is changed. It is, in principle, possible to determine Da(z) by measuring the £ 
position of the first BAO peak. 
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We next consider the imprint of the BAO on the cross-correlation signal along the radial 
direction. This is quantified, as before, through the radial decorrelation function Ke(Az). We 
expect the first BAO peak to have an imprint on Kg(Az) at only the angular modes £ which 
are less than k±r, where k\ refers to the position of the first peak in the 3D matter power 
spectrum. Thus, for z = 2.5 we do not expect the first peak to have any impact on Ke(Az) 
at £ > 500. We expect the first BAO peak to have an impact at all angular modes £ < 500. 
The relative contribution is expected to be maximum around £ ~ 270, and then fall off at 
£ < 100 where it will be diluted by the k modes which are smaller than k\ and hence do not 
contain the BAO signal. Figure 8 shows the radial decorrelation function Ke(Az) for four 
different values of £ (= 100,250,300,500). For comparison, we also show K e (Az) calculated 
using P nw (k) which does not have the BAO features. 

We recollect that ki(Az) is maximum at Az = where Ki(Az) — 1. In the absence of the 
BAO features the value of Kg(Az) falls rapidly with increasing Az. This is followed by a Az 
range where Kg(Az) is negative, beyond which it oscillates around 0. The BAO, we find, has 
little impact on the Az range near Az = where k^{Az) is positive. The BAO features are 
found to have a very significant effect at large Az, typically in the range Az = 0.04 to 0.16, 
where Kg(Az) is mainly negative. The corresponding 21-cm frequency and radial velocity 
intervals are Av = 4.64 - 13.9 MHz and Av\\ = 3.5 x 10 3 - 10.3 x 10 3 km/s respectively. 
The imprint of the BAO appears as a ringing feature around the smooth 'no wriggles' 
Ki(Az). The ringing feature due to the BAO is quite distinct from the slow oscillation of 
Ke(Az) which is also present in the no-wiggles model. The value of Kt(Az) is quite small 
(~ —0.01) in the Az range where the BAO features occur. The deviation due to the BAO, 
however, could be as large as 40% to ~ 100% relative to the no-wiggles model. 

The position of the BAO feature shifts to smaller Az values if £ is increased. For a fixed 
£, we use A^bao to denote the position corresponding to the maximum deviation from the 
no-wiggles model. The value Az s = H(z)s/c, introduced earlier, corresponds to A^bao at 
£ = 0. It is not possible to directly determine Az s from ki(Az). It is However, it is in 
principle possible to determine H(z) by measuring Az B ao & t different £ values. 

4 THE CROSS-CORRELATION ESTIMATOR 

In this section we construct an estimator for Pjr T {U, Az), and consider the statistical proper- 
ties of this estimator. First, we assume that both the Ly-a forest and the 21-cm observations 
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are pixelized along the z axis into pixels or channels of width Az c such that both 5j?(9, z) 
and St{9,z) are measured only at discrete redshifts z n = zo + nAz c with n — 1,2, ...,N C . 
Here z is a reference redshift, N c is the total number of channels and N c Az c = B is the 
total redshift interval or the bandwidth spanned by the observations. 

Considering first the Ly-a forest, we have, till now, considered 5^(0, z n ) as a continuous 
field defined at all points on the sky. In reality, it is possible to measure this only along 
a few, discrete lines of sight where there are background quasars. We account for this by 
defining 5jr (9,n). the observed fluctuation in the transmitted Ly-a flux, as 

Sjr o (0, n) = p{9) [ Sjr(6, z n ) + 5M9, n)} (8) 

where 5^(9, z) is the contribution from the pixel noise in the quasar spectra and 

is the quasar sampling function. Here a = 1,2, ...,N refers to the different quasars in the 
L x L field of view, 9 a and w a respectively refer to the angular positions and weights of the 
individual quasars. We have the freedom of adjusting the weights to suit our convenience. It 
is possible to change the relative contribution from the individual quasars by adjusting the 
weights w a . 

The quasar sampling function p{9) is zero everywhere except the angular position of the 
different quasars. It is sometimes convenient to express the noise contribution in eq. (8) as 

p{9) 5M9, n) = *^-&) W£,n) (1Q) 

where Sjr^(9 a ,n) refers to the pixel noise contribution for the different quasars. The faint 
quasars typically have noisy spectra in comparison to the bright ones. We can take this into 
account and choose the weights w a so as to increase the contribution from the bright quasars 
relative to the faint ones, thereby maximizing the SNR for the signal estimator. For the 
present analysis we have made the simplifying assumption that the magnitude of Sjr^(9 a ,n) 
is the same across all the quasars irrespective of the quasar flux. We have modelled 8^(9 a , n) 
as Gaussian random variables with the noise in the different pixels being uncorrelated ie. 

(11) 

where ajr N is the variance of the pixel noise contribution. It is appropriate to use uniform 
weights w a = 1 in this situation. Such an assumption is justified in the situation where there 
exist high SNR measurements of the transmitted flux for all the quasars. 

Considering next the sampling function p(9), we assume that the quasars are randomly 
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distributed with no correlation amongst their angular position, and the positions also being 
unrelated to 5jr and 5t- In reality, the quasars do exhibit clustering (Myers et al., 2007), 
however the contribution from the Poisson fluctuation is considerably more significant here 
and it is quite justified to ignore the effect of quasar clustering for the present purpose. The 
Fourier transform of p{9) then has the properties that 

(p(U)> = 5 Ui0 (12) 

and 

(p(U0p(U 2 )) = 1 5 UliU2 + (l - 5 Ulfi d V2 , (13) 

which we shall use later. In the subsequent analysis, we also assume that N ^> 1 whereby 
(1 - l/N) « 1. 

We use Aj- (U, z) to denote the Fourier of 5j? (9, z). Using eq. (8) we have 

A^(U, z) = p(U) ® [A^(U, z) + A^(U, z)\ (14) 

where ® denotes a convolution defined as 

p(U) ® A^U, z) = ± P(U - U')A T (U>, z) (15) 

u> 

Using eqs. (3), (10), (11), (12), (13) and (14) we calculate the following statistical prop- 
erties of Aj- , 

(A^ o (U,n)) = (16) 

and 

(Ajr (V u n)A*jr (V 2 ,m)) = <f Ul)Ua 



-Lp^Ui, (n - m)Az c ) 



U 



(17) 



It is possible to simplify the sum over U using Parseval's theorem whereby 

^ PMU, (n - m)Az c ) = &((n - m) Az c ) , (18) 
u 

— * — * 

where ^{Az) = (5jr(6 a , z)Sjr(9 a , z + Az)) is the one dimensional (ID) correlation function of 
the fluctuations in the transmitted flux along individual quasar spectra. The ID correlation 
£jr(Az), or equivalently £f(i>ii), is traditionally used to quantify the Ly-a forest along quasar 
spectra, and this has been quite extensively studied (Becker et al., 2004; Coppolani et al., 
2006; D'Odorico et al, 2006). 
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Using fiQ = N/L 2 to denote the quasar density on the sky, we have 

(A To (U 1 ,n)A* Fo (U 2 ,m))=5 lJul j 2 L- 2 P^ (U 1 ,n-m) (19) 

where 

Pra,(U,p) = P TT (XJ,pAz c ) + — [ir{pAz c ) + (20) 

Ideally, we would expect Pjr To {\J,p) to provide an unbiased estimate of Pj-(XJ : pAz c ). The 
discrete quasar sampling, however, introduces an extra term ^jr{pAz c ) /fiq. The other term 
SpfiO'j^/fiQ arises due to the pixel noise, and it contributes only when we correlate a channel 
with itself. Though assumed to be a constant across the redshift range considered here, we 
note that fiq is infact a function of the magnitude limit of the survey through the luminosity 
function 7^(2) (Jiang et al., 2006) which accounts for the variability of quasar luminosities. 

Radio interferometric observations directly measure A T (U,z n ). We consider a radio- 
interferometric array with several antennas, each of diameter D. The antenna diameter and 
the field of view L are related as X/D ~ L, where A is the observing wavelength. Each pair 
of antennas measures A T (U,z n ) at a particular U mode corresponding to U = d/A, where 
d is the antenna separation projected perpendicular to the line of sight. The baselines U 
corresponding to the different antenna pairs are, in general, arbitrarily distributed depending 
on the array configuration. The observed HI fluctuation A To (U, n) at two different U values 
are correlated if | Ui — U2 |< 1/L. It is possible to combine the baselines where the signal 
is correlated by binning the U values using cells of size L^ 1 x L^ 1 . We then have the binned 
baselines at U = (n x i + n y j)/L ( Tl X , Thy £1X6 integers) which exactly match the Fourier modes 
of the Ly-a signal. The HI signal at different U values are now uncorrelated. We then have 

A To (U, n) = A T (U, z n ) + A TN (U, n) (21) 

where A™ is the corresponding noise contribution. The noise in different channels and 
baselines is uncorrelated, and 

(A To (Ui,7i)A£ (U 2 ,m)> = 5 l]u u 2 L 2 P TTo (U 1 ,n-m) (22) 

where 

Ptto(U, P ) = P TT (U,pAz c ) + 5 Pi0 N T (U) . (23) 

Here Nr(U) is the noise power spectrum. For a single polarization and a single baseline, this 
is given by 

Nt(u) = f_i_) W| (24) 
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where T sys is the system temperature, Az/ C the frequency interval corresponding to Az c , At 
the integration time and V(6) is the normalised power pattern of the individual antennas 
(Chengalur et al., 2007). The exact value of the ratio of the two integrals in eq. (24) depend 
on the antenna design. It is convenient here to express eq. (24) as 

N ^OTTOT (25) 

where N po i is the number of polarizations being used, M(U) the number of baselines in the 
particular cell corresponding to U, and x is a factor whose value depends on the antenna 

— * 

beam pattern V{9). For the purpose of this paper it is reasonable to assume a value \ — 0-5. 
We have 

(i [Ajr (Ui, n) A^ (U 2 , m) + A> a (Ui, n) A To (U 2 , m)\) = <S Ul)Ua Ppr (Ui, n - m) (26) 
where 

Pjr ro (\J,p) = Pjr r (\J, P Az c ) (27) 

which can serve as an estimator for the cross-correlation signal. It is, however, possible to 
increase the signal to noise ratio by averaging over the entire redshift interval (or frequency 
band). We therefore define the estimator E(XJ,p) as 

E( ) = Eti \ [Ajr (n)Ay (m) + A> D (n)A ro (m)] 

Z^n=l Z^m=l "|n-m|,p 

The various terms in eq. (28) all refer to the same U value which is not explicitly shown for 
convenience of notation. Note that we shall adopt this convention of not explicitly showing 
U in several of the subsequent equations. 
The estimator has the property that 

(E(U,p)} = PMU, P Az c ) (29) 

ie. it is an unbiased estimator for the cross-correlation signal. We next consider the covariance 
(E(\J 1 ,p)E(\J 2 ,q))-(E(lJ 1 ,p)) (E{U 2 ,q)) which, we find, is zero if Ui ^ U 2 . We therefore 
only consider 

Cov(p, q) = ( AE(p) AE(q)) = (E(p) E(q)) - (E(p)) (E(q)) (30) 

where all the terms refer to the same U. 
We note that 

{[A To {n)A* To {m) + A»)A To (m)] [A To (s)A* To (t) + A* To (s)A To (t)}) = 
4 PTTo(n - m)P FTo {s -t)+2 P TTo {n - t)P TTo (s - m) 
+ 2P TFo (n-s)P TT o{t-m) (31) 
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whereby 

Cov{p, q) = ^= — (32) 

^ 2^,a,/3 2^im,n,s,t °\m-n\,p°\s-t\,q 

where the variable a' has value J 7 when a = T and vice- verse, and (3, (5' are defined in a 

similar way. We simplify eq. (32) in two steps where we first have 

n , v _ Ea l/3 Yl=x Yl=i P *Po(p + m-n) P a ,p> (m - n - q) 
C ° V(P ' q) ~ 8(N c -p)(N c -q) • (33) 

Next, assuming that p > q, we have 

1 f 1 Nc ~ p 
Cov(p, q) = g _ g ) | ^ _ p ) ( N c-P~ m ) [Papoip + m)P a ,p (m - g) 

+ Pafioip -Q + rn)P a ,p, {m)} + ^ [ P a!3o(p ~ m)P a ,p, (q + m) 

m=0 

+ P a po(p ~ q ~ m)P a/ p (m)] } (34) 

We use eq. (34) to calculate the covariance Cov(U,p, q) for the individual U values. The 
expected signal is statistically isotropic in U, and it is useful to bin the estimates of the 
power spectrum. We have the 'binned' estimator 

Eb(Ub , p) = a g*SM ( 35, 

where Wi refers to the weights assigned to the estimates at different Uj, and U B refers to 
the average baseline of the particular bin 

Ub ~ ■ (36) 

The binned estimator has a covariance 

a "" f " M)= ^E~w^ • (37) 

where the parameter N poin refers to the number of independent pointings. This parameter is 
introduced to allow for the possibility that we are combining estimates of the power spectrum 
from observations in N poin independent parts of the sky. It is quite clear from eq. (37) that 
this helps to increase the signal to noise ratio as 1/ ^N poin . In subsequent parts of this paper, 
we have used eq. (37) to predict the noise in observations to measure Pjr T (U, Az). 



5 OBSERVATIONAL CONSIDERATIONS 

The quasar redshift distribution peaks in the range z = 2 to 3 (Schneider et al., 2005), and 
for our analysis we only consider the quasars in this redshift range. For a quasar at a redshift 
zq, the region 10, 000 km s -1 blue-wards of the quasar's Ly-a emission is excluded from the 
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Ly-a forest due to the quasar's proximity effect. Further, only the pixels at least 1,000 
kms -1 red-ward of the quasar's Ly-/3 and O-VI lines are considered to avoid the possibility 
of confusing the Ly-a forest with the Ly-/3 forest or the intrinsic O-VI absorption. For a 
quasar at the fiducial redshift zq = 2.5, the above considerations would allow the Ly-a forest 
to be measured in the redshift range 1.96 < z < 2.39 spanning an interval Az = 0.43. 

We consider redshifted 21 cm observations of bandwidth B = 128MHz covering the 
frequency range 355 MHz to 483MHz which corresponds to the redshift range 1.94 < z < 3 
with bandwidth B — 1 in redshift units. The Ly-a forest of any particular quasar will be 
measured in a smaller interval Az « 0.4 which is the deciding factor for the cross-correlation 
signal. Thus, only a fraction (approximately 40%) of the total number of quasars in this 
redshift range 2 to 3 will contribute to the cross-correlation signal at any redshift z. We 
incorporate this in our estimates by noting that Uq in eq. (20) refers to only 40% of all the 
quasars in the entire z range 2 to 3. 

Cosmic variance is a limiting factor for measuring the BAO signal. We first present a 
preliminary analysis to determine the observational considerations so that the first BAO 
peak is above the cosmic variance. This depends on Nk the number of independent Fourier 
modes in the k range k a = 0.03 Mpc -1 to k b = 0.07 Mpc -1 which covers the first BAO peak 
(Figure 6). We have 



where V is the observational volume. For observations covering the solid angle f2 and the z 
range 1.94 to 3, we have N k = 2.38 x 10 5 Q. The BAO peak is a 10% feature in P(k) and it is 
necessary to divide the interval k a to fc& into N bin bins in order to identify this feature. An N a 
detection of the first BAO peak requires that the uncertainty in the observed power spectrum 
AP(k)/P(k) = \/N hin /N k should be less than l/(10iV CT ). We then have an estimate of the 
solid angle that needs to be observed for a N a detection 



We would like a single field of view of the radio-interferometer f2 = L 2 to be large enough 
so as to cover this solid angle, whereby we need 



A 3 — a detection with N^ n = 5 requires a field of view L = 8° which can be achieved if 
we have a radio-interferometric array where the individual antennas are around D = 5 m 




(38) 




(39) 




(40) 
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Figure 9. The data points shows the binned transverse angular power spectrum Pjr T [t) for the cross-correlation signal at 
z = 2.5 with 1 — a error bars. The error bars correspond to the set of observational parameters (see text) that give a 5 — a 
detection of Pjr T (l). 

in diameter. It is advantageous to have a larger field of view, and we consider antennas of 

diameter D = 2m which gives a L = 20° field of view for which a 7.7 — a detection is possible 

if Nbi n = 5. Note that this sets the upper limit for the signal to noise ratio (SNR) that can 

be achieved in a single pointing. In the next section We present more detailed estimates of 

the SNR, taking into account various factors like the discrete quasar sampling and the noise 

in the quasar spectra and the radio data. 

We next discuss the array layout that would be required for these observations. Using 

d/X = U = kr/2ir, we estimate that the Fourier modes k a and kt> correspond to antenna 

separations d a = 15m and df, = 36m respectively. These figures roughly set the range of 

antenna separations that would be required in the radio-interferometric array. Based on 

these considerations we consider a radio interferometric array which has N ann antennas 

distributed such that all the baselines d within d max = 50 m are uniformly sampled, whereby 

M(U) is independent of U and we have M(U) « 4 (iV ann /100) 2 . Using this in eq. (25), 

assuming T sys = 100 K, N po i = 2, x — 0.5 we have 

, r TAl2 / 100 \ 2 / 100 KHz\ / 1000 hrs\ . . 

N T = 1.0 x 10- [ruKf (_) (_) (_) . (41) 

6 DETECTABILITY 

In the previous section we have discussed several observational considerations which essen- 
tially determine the redshift interval and angular scales that need to be covered in order 
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Figure 10. Contours of SNRi, the signal to noise ratio for detecting the angular power spectrum in a single field of view. 
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Figure 11. The variation of SNRi with N T for n Q = 6.4 dog" 2 and 25.6 dcg~ 2 corresponding to the BOSS and BIGBOSS 
respectively. We also show | ^f^jf^r I which gives an estimate of how SNRi scales as Nt is reduced. This is expected to 
have a value 0.5 for large Nt and drop to as Nt — > 0. We find that the SNRi oc Nt~ 0,5 scaling is approximately valid for 
Nt > 10 -4 mK 2 , and SNRi falls slower than this for smaller values of Nt- 

to detect the imprint of the BAO in the cross-correlation signal . We have seen that obser- 
vations covering the redshift interval from z = 2 to 3 using a radio interferometric array 
with antennas of diameter 2 m each are well suited for this purpose. Further, for the purpose 
this paper, we have made the simplifying assumption that the array layout is such that it 
uniformly samples all the baselines within 50 m. Given this observational framework, we now 
estimate the required observational sensitivity for a detection of the BAO signal. 

The variance of the cross-correlation estimator Cov(p,p) = ( AE(p) AE(p) ) is a sum of 
several terms (eq. 34) of the form Pj^rip + m ) Ptt(p — m ) and Ptto(p + m ) PttJ^P ~ m )i 
where P?t refers to the cross-correlation signal that we are trying to detect. The terms 
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Figure 12. This shows the first BAO peak in the transverse angular power spectrum Ptt(P) for the cross-correlation signal. To 
highlight the BAO we have divided Pjr T (l) by PjV^if) which has no- wiggles. The binned data points and error bars correspond 
to 1 — cr for the set of observational parameters (see text) which give a 5 — a detection of the transverse BAO. 
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Figure 13. This shows Aki(Az), the difference between the BAO and no-wiggle models for the cross-correlation signal at the 
I bin centered at i = 230. The binned data points and error bars correspond to 1 — it for the set of observational parameters 
(see text) which give a 5 — a detection of the radial BAO. 

Ptto and Ptto refer to the auto-correlation of the Ly-a and redshifted 21-cm observations 
respectively. Note that Pjto arid Ptto both have contributions from observational noise, and 
hence they differ from the respective cosmological auto-correlations signals Ptt and Ptt- 

We first consider Pjj a which refers to the Ly-a forest. In addition to -Pjfjf, this also has 
a contribution from noise that arises due to the discrete QSO sampling. This noise, which 
arises from the discrete sampling of quasars, is proportional to Uq 1 (eq. 20). To recapitulate, 
fiQ refers to approximately 0.4 times the total angular density of quasars in the redshift 
range 2 to 3. This noise also depends on £jr for which we have used the values from Becker 
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et al. (2004). Further, we have assumed that the Lyman-o; forest spectra are all measured 
with a high sensitivity such that the pixel noise contribution has a value a^ N = 0.04. 

We next consider Ptto which refers to the redshifted 21-cm observations. In addition to 
the cosmological signal Ptt, this also has a contribution N T from the system noise (eq. 23). 
The quantity N T refers to the noise power spectrum of the radio interferometric observations. 
The value of N T depends on several observational parameters. In the present context, for 
observations with a fixed channel width Az/, Nt depends only on the number of antenna 
N ann and the observing time At (eq. 41). We see that there are many different combinations 
of N ann and At which will give the same value of Nt- 

In the present analysis we have used uq and Nt to parametrize the respective sensitivities 
of the QSO survey and the redshifted 21-cm observations We have Ptto — > Pjt and Ptto — > 
P T t in the limits Uq — > oo and Nt — >■ respectively. The limit uq — > oo and Nt — >■ sets 
the upper bound for the SNR, which also corresponds to the cosmic variance. The additional 
noise contributions arising from fiq and Nt will degrade the SNR to a value which is lower 
than the cosmic variance. In this work we would like to determine the range of uq and 
N T where it will be possible to detect the BAO feature. The entire subsequent discussion 
refers to the fiducial redshift z = 2.5. Though we expect all the quantities to vary across the 
redshift range z = 2 to z = 3, we anticipate that the values at z — 2.5 will be representative 
of the entire z range. 

We first consider the transverse angular power spectrum Pjr T (£) = Pj^t(U, Az) at Az = 
0. It is also possible to study the transverse angular power spectrum by considering the 
U dependence of Pjr T (U, Az) while holding Az fixed at some other value Az > 0. The 
signal, however, is maximum for Az = 0. Further, it is also possible to increase the SNR 
by suitably combining the estimates at different values of Az. However, we do not consider 
these possibilities here, and we only consider the transverse angular power spectrum at 
Az = 0. The £ range 125 < £ < 330, we have seen, is adequate for detecting the first BAO 
peak (Figure 7). We assume that the baselines U within this £ = 2n | U | range have been 
divided into 5 bins of equal AU, and we consider the results (eg. Figure 9) at the average 
£ values (eq. 36) corresponding to each of these bins. We have set Wi = 1 whereby all all 
the baselines within a bin contribute equally to the binned estimator (eq. 35). Here we have 
chosen a channel width of lOOKHz for the 21-cm data, and we assume that the Ly-a forest is 
smoothed at the corresponding velocity width of m lOOKms -1 . The cross-correlations signal 
starts to decorrelate (Figure 3) if the frequency channel width is larger. We have used the 
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results for the central £ bin, which roughly corresponds to the center of the first BAO peak, 
to assess the overall SNR. Note that in the subsequent discussion we use SNRi to refer to the 
signal to noise ratio for a single field of view, and SNR (or total SNR) to refer to the general 
situations where there are N poin (> 1) independent pointings of the 20° x 20° field of view. 
Figure 10 shows SNRi contours as a function of n Q and N T . We see that the SNRi does not 
increase very significantly beyond uq > 60deg~ 2 or < 10~ 7 mK 2 . We have the maximum 
value of the SNRi at the bottom right corner of Figure 10 where both of these conditions 
are satisfied. This corresponds to the uq — > oo and Nt — > limit, and the maximum SNRi 
, which corresponds to the cosmic variance, has a value SNRi =300. Note that the SNRi 
contours in Figure 10 refer to the detection of the cross-correlation signal Pj? T (£), and not the 
BAO which is just a 1% feature in Pjr T (£). We find that a 5 — a detection of Pft{£) is possible 
for fiQ ~ 0.1 deg" 2 and N T ~ 10~ 4 mK 2 . The required QSO density is well within the scope 
of present observational capabilities, for example the currently available SDSS (Schneider 
et al., 2005) has a total QSO number density of ~ 1 deg" 2 implying uq ~ 0.4 deg~ 2 which 
is in excess of the required QSO number density. The requirement that Nt ~ 10 _4 mK 2 . can 
be achieved in 1, 000 hrs if we have 300 antennas. A longer observation will be required if we 
have fewer antennas (eq: 41). Figure 7 shows the binned data points and the 1 — a error-bars 
that are expected in such an observation. 

The first BAO peak is a 1 % feature in Pjtt(£), and a 5 — o detection of the BAO peak 
requires an SNR of 500 for Pj T [t). It is not possible to reach an SNR greater than 300 with 
a single field of view (Figure 10), and it is necessary to consider multiple pointings. The 
total signal to noise ratio increases as 

SNR =^N~ SNRi, (42) 

when the number of independent fields of view is increased. We require 100 independent 
fields fields of view to detect the BAO with the currently available SDSS. This exceeds the 
angular area of the SDSS, and is not viable. It is necessary to consider a survey with a higher 
quasar density. The upcoming BOSS (McDonald et al., 2005) is expected to have a QSO 
density of 16 deg -2 which corresponds to n Q = 6.4 deg -2 . The BOSS 3 survey is expected to 
cover ~ 10, OOOdeg 2 of the sky, and we could ideally have N poin = 25 independent pointings 
of the 20° x 20° field of view. We see that, with BOSS, it is possible to achieve an SNR 
larger than 100 in a single field of view (Figure 10). Therefore, a 5 — a detection of the first 
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BAO peak is possible with N poin < 25, which is within the total angular coverage of BOSS. 
Figure 11 shows how SNRi varies with Nt for a single field of view of BOSS. We see that 

— 1/2 

the SNRi scales as N T for large Nt where the system noise in the radio observations 
is much larger than the HI signal. In this regime we have SNR oc \fN poin /N T it. relatively- 
shallow observations of a large number of fields of view, or deep observations of a few fields 
of view would both give the same SNR provided that the total observing time N poin At (eq. 
41) is the same in both of these situations. Ideally, it is most advantageous to work in this 
region. However, we see that the SNR oc A^ 1//2 scaling only holds at Nt > 10~ 4 mK 2 where 
the SNRi is pretty low (~ 40), and a 5 — a detection of the BAO peak is not possible 
within the total angular coverage of BOSS. It is necessary to consider observations that go 
deeper than 10~ 4 mK 2 . The increases in SNRi is slower than N^ 1 ^ 2 for N T < 10~ 4 mK 2 , 
and the SNRi saturates at Nt < 1CT 7 mK 2 where N T is much smaller than the HI signal. 
The range 10~ 5 mK 2 > N T > KT 6 mK 2 , where the SNRi is in the range 100 to 200, is 
relevant for detecting the BAO with the cross-correlation signal using BOSS. In this region 
it is optimal to increase the total SNR by increasing the number of fields of view instead 
of increasing the depth of the individual observations. We find that we have SNRi = 100 
for N T = 1.1 x 10~ 5 mK 2 , and a 5 — a detection of the first BAO peak is possible with 
Npdn = 25 fields of view. Alternatively, a 5.8 — a detection is possible with N poin = 16 and 
N T = 6.25 x 10~ 6 mK 2 . The scaling is approximately SNR oc N^ in N^ - 38 here, and the 
total observing time is smaller if we consider the shallower observations. Figure 12 shows 
the expected binned data points and error-bars for the detection of the first BAO peak. 

The BIGBOSS (Schlegel et al., 2009) has been conceived as the successor to the upcoming 
BOSS QSO survey. BIGBOSS may achieve a QSO density of ~ 64 deg~ 2 which corresponds 
to fiQ = 25.6 deg -2 . This is very close to the region where we have the cosmic variance limit, 
and the limiting SNRi of ~ 300 can be achieved if Nt ~ 2 x 10~ 6 mK 2 (Figure 10). For this 
value of Nt, a 5 — a detection of the first BAO peak is possible with only two fields of view. 
However this is not the optimal observational strategy. For BIGBOSS, the behaviour of the 
SNRi as a function of N T is very similar to that for BOSS, except that the SNRi values 
are 1.5 times larger (Figure 11). The range 3.3 x 10" 5 mK 2 > N T > 10~ 6 mK 2 , where the 
SNRi is in the range 100 to 300, is relevant for detecting the BAO with the cross-correlation 
signal using BIGBOSS. It is most advantageous to consider relatively shallow observations 
with Nt = 3.3 x 10~ 5 mK 2 , for which a detection is possible with N po i n = 25. The scaling 
here is SNR oc N po 5 in N^ 0A3 , which is quite close to the N^ ' 5 behaviour. 
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We next consider the detection of the radial oscillations. We have seen (Figure 8) that 
the radial oscillations occur in the Az range 0.04 to 0.14. where, in the absence of the 
BAO, the cross-correlation signal is anti-correlated ie. Ki(Az) < 0, and relatively weak with 
| Ke(Az) |~ 0.01 . The BAO introduces a ringing feature which is highlighted in Figure 
13 which shows An e (Az) which is the difference between the BAO and the the no-wiggles 
model. We see that the deviation could be as large as Ane(Az) ps 0.01. Therefore, the net 
effect of the radial oscillations is a ~ 1% deviation relative to Pjr T (£), which is comparable 
to the deviation introduced by the angular oscillations. Though the angular and the radial 
oscillations both introduce ~ 1% deviations relative to the no-wiggles model, we do not 
expect that every observation which is capable of detecting the angular oscillations will 
also be able to detect the radial oscillations. The radial oscillations occur at Az ps 0.1, 
and we have only (B/Az) ~ 10 independent estimates in our observation. In contrast, the 
tangential feature occurs at 9 S ps 1.38° and we have (20°/20°) 2 ~ 210 independent estimates. 
We thus expect the radial oscillations to have a lower SNR in comparison to the tangential 
oscillations discussed earlier. The BAO signal is maximum in the vicinity of t ~ 250, and 
we have collapsed the three central i (or U) bins in order to enhance the SNR for the radial 
oscillations. We find that this enhances the SNR by a factor of approximately y/2/7. The 
frequency channel width is maintained at Au = 100 KHz which is the same as the value used 
for the tangential oscillations. Note that the errors in the Pj?t(U, Az) values estimated at 
different Az will, in general, be correlated and a rigorous error analysis would require us to 
calculate the full covariance matrix (eq. 34). We have not used the covariance matrix in the 
present analysis, Instead, we have assessed the SNR for a detection of the radial oscillations 
by using just the single value at Az = 0.1 where the deviation from the no- wiggles model is 
maximum (Figure 13). We find that a 5 — a detection is possible with BOSS if we observe 
25 fields of view with N T = 6.25 x 10~ 6 mK 2 . We require N T = 1.7 x 10 5 mK 2 for a similar 
detection with BIGBOSS. 



7 SUMMARY AND DISCUSSION 

In this paper we have developed a theoretical formalism for estimating the cross-correlation 
signal between the fluctuations in the Ly-a forest and the fluctuations in the redshifted 21- 
cm emission from neutral hydrogen. Both of these quantities are measured as functions of 
frequency (redshift) and angular scale. Consequently, we have used the Multi-frequency An- 
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gular Power Spectrum (MAPS) to quantify the statistical properties of the cross-correlation 
signal. This deals directly with the observed quantities, and retains the distinction between 
the angular and the frequency information. This, as we shall elaborate shortly, is very im- 
portant in the light of foreground removal and continuum subtraction. 

Continuum fitting and subtraction is a very critical step in calculating 5jr for the Ly- 
a forest and several different methods have been proposed for handling this (Croft et al., 
2002; McDonald et al., 2006). Errors in continuum subtraction can be a serious problem 
for the Lj-a forest auto-correlation signal (Kim et al., 2004). The redshifted 21-cm signal 
is buried deep under astrophysical foregrounds which are several orders of magnitude larger 
(Shaver et al., 1999; Di Matteo et al., 2002; Santos et al., 2005; Wang & Hu, 2006; Ali et al, 
2008; Pen et al., 2009; Ghosh et al., 2010). Several different techniques have been proposed 
for separating the cosmological 21-cm signal from the foregrounds (Ali et al., 2008; Jelic 
et al., 2008; Bowman et al., 2009; Liu et al., 2009; Ghosh et al., 2010). Foreground removal 
is a rather severe problem for observations of the redshifted 21-cm auto-correlation signal. 
In addition to the 21-cm signal, the foregrounds make a very large contribution to the 
expectation value of the auto-correlation estimator. However, the foregrounds are believed 
to have a slowly varying, smooth frequency (or Az) dependence which is quite distinct from 
the signal which decorrelates rapidly with increasing Az (Figure 3). It is therefore possible 
to remove the foregrounds from the measured MAPS (-Pjto(^, Az) ) by subtracting out 
any component that varies slowly with Az. In fact, Ghosh et al. (2011) have recently used 
MAPS to analyze 610-MHZ GMRT observations and show that it is possible to remove 
the foregrounds from the m auto-correlation by fitting and subtracting out slowly varying 
polynomials in Av. 

We do not expect the continuum in the Ly-a forest to have any correlation with the 
foregrounds in the redshifted 21-cm observations, and consequently they will not contribute 
to the expected cross-correlation signal. The continuum and the foregrounds will, however, 
appear as extra contributions to the variance of the cross-correlation estimator. Since these 
contributions appear in the variance, it is possible to reduce these by combining different 
independent estimates of the cross-correlation. We can reduce the continuum and foreground 
contributions by combining estimates of the cross-correlation at different baselines U and 
different fields of view. The problem, therefore, is much less severe in comparison to the 
auto-correlation. Further, the continuum and the foregrounds are both expected to have a 
slowly varying frequency (or Az) dependence and it should be possible to remove these from 
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the measured MAPS by subtracting out the component that varies slowly with Az. We plan 
to perform a detailed analysis of these issues in future. 

Our study shows that it is possible to have a 5 — a detection of the imprint of the first 
BAO peak in the cross-correlation signal using BOSS, an upcoming QSO survey. For this, 
we have considered a radio interferometric array that covers the z range z = 2 to 3 using 
antennas of size 2 m x 2 m which have a 20° x 20° field of view. We find that we need to 
observe 25 fields of view, approximately the full angular coverage of BOSS, with a noise 
level of N T = 1.1 x lCT 5 mK 2 and N T = 6.25 x lCT 6 mK 2 in order to achieve a 5 — a 
detection of the angular and radial oscillations respectively. The corresponding noise levels 
are N T = 3.3 x KT 5 mK 2 and N T = 1.7 x 1(T 5 mK 2 for BIGBOSS whose quasar density is 
expected to be four times larger than that of BOSS. 

We now briefly discuss how it may be possible to carry out such observations. In our 
analysis we have made the simplifying assumption that the antennas are distributed such 
that all the baselines within 50 m are uniformly sampled. We consider an interferometric 
array with N ann = 400 antennas which roughly corresponds to the maximum number of 
2 m x 2 m antennas that can fit in a 50 m x 50 m region. We see that we need At = 5700 hr 
and 10, 000 hr of observation per field of view (41) to reach the noise levels required to detect 
the angular and radial oscillations respectively with BOSS. The corresponding figures are 
1900 hr and 3700 hr for the BIGBOSS. Note that we it is required to observe a single field 
of view for 8 hr a day for a whole year in order to achieve 3000 hrs of observing time. 

It is quite evident that we require to observe 25 fields of view, with 2 to 3 years of 
dedicated observations for each field, in in order to detect the BAO. Operating sequentially, 
considering one field after the next, the required observations would possibly run over a 
period of 50 yr to a century, which raises the need to consider alternative observational 
strategies. It is a viable possibility to have antennas that can simultaneously observe several 
independent fields of view. However, it is unlikely (if not impossible) to have antennas that 
can simultaneously observe 16 or 25 such 20° x 20° fields of view. For the purpose of this 
discussion, we assume that we have antennas that can simultaneously observe 4 fields of view. 
We then see that it would approximately require observations over a decade (or more) in 
order to detect the BAO. Another possibility is to have several radio interferometric arrays, 
each located at a different location and observing a different parts of the sky. Four to five 
separate arrays, each with 400 antennas, would be required to carry out these observations 
in the span of a few years. It is important to note that it may be possible to reduce the 
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observational requirements to some extent by optimally distributing the baselines instead 
of considering them to be uniformly distributed. We propose to investigate these issues in a 
future study. 

Observations of the BAO can be used to constrain the values of various cosmological 
parameters. The equation of state of the Dark Energy is particularly important in this 
context. In this paper we have mainly estimated the range of observational parameters for 
which it will be possible to detect the BAO using the cross-correlation signal. We plan 
to study a variety of issues including the optimal array configuration and cosmological 
parameter estimation in future. 
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